DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 52

1
Probing for the Usage of Grammatical Number ...
BASE
Show details
2
On Homophony and Rényi Entropy ...
BASE
Show details
3
On Homophony and Rényi Entropy ...
BASE
Show details
4
On Homophony and Rényi Entropy ...
BASE
Show details
5
Finding Concept-specific Biases in Form--Meaning Associations ...
BASE
Show details
6
Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models ...
BASE
Show details
7
Revisiting the Uniform Information Density Hypothesis ...
BASE
Show details
8
Revisiting the Uniform Information Density Hypothesis ...
BASE
Show details
9
Modeling the Unigram Distribution ...
BASE
Show details
10
A Bayesian Framework for Information-Theoretic Probing ...
BASE
Show details
11
A surprisal--duration trade-off across and within the world's languages ...
BASE
Show details
12
Revisiting the Uniform Information Density Hypothesis ...
BASE
Show details
13
What About the Precedent: An Information-Theoretic Analysis of Common Law ...
BASE
Show details
14
Modeling the Unigram Distribution ...
BASE
Show details
15
Finding Concept-specific Biases in Form–Meaning Associations ...
BASE
Show details
16
How (Non-)Optimal is the Lexicon? ...
Abstract: The mapping of lexical meanings to wordforms is a major feature of natural languages. While usage pressures might assign short words to frequent meanings (Zipf's law of abbreviation), the need for a productive and open-ended vocabulary, local constraints on sequences of symbols, and various other factors all shape the lexicons of the world's languages. Despite their importance in shaping lexical structure, the relative contributions of these factors have not been fully quantified. Taking a coding-theoretic view of the lexicon and making use of a novel generative statistical model, we define upper bounds for the compressibility of the lexicon under various constraints. Examining corpora from 7 typologically diverse languages, we use those upper bounds to quantify the lexicon's optimality and to explore the relative costs of major constraints on natural codes. We find that (compositional) morphology and graphotactics can sufficiently account for most of the complexity of natural codes -- as measured by code ... : Tiago Pimentel and Irene Nikkarinen contributed equally to this work. Accepted at NAACL 2021. This is the camera ready version ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://dx.doi.org/10.48550/arxiv.2104.14279
https://arxiv.org/abs/2104.14279
BASE
Hide details
17
Disambiguatory Signals are Stronger in Word-initial Positions ...
BASE
Show details
18
Modeling the Unigram Distribution
In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (2021)
BASE
Show details
19
What About the Precedent: An Information-Theoretic Analysis of Common Law
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
BASE
Show details
20
Finding Concept-specific Biases in Form–Meaning Associations
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
52
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern